First Story Detection using Entities and Relations
نویسندگان
چکیده
News portals, such as Yahoo News or Google News, collect large amounts of documents from a variety of sources on a daily basis. Only a small portion of these documents can be selected and displayed on the homepage. Thus, there is a strong preference for major, recent events. In this work, we propose a scalable and accurate First Story Detection (FSD) pipeline that identifies fresh news. In comparison to other FSD systems, our method relies on relation extraction methods exploiting entities and their relations. We evaluate our pipeline using two distinct datasets from Yahoo News and Google News. Experimental results demonstrate that our method improves over the state-of-the-art systems on both datasets with constant space and time requirements.
منابع مشابه
IREvent2Story: A Novel Mediation Ontology and Narrative Generation
Event detection is a key aspect of story development which is composed of multiple narrative layers. Most of the narratives are template-based and follow a narration theory. In this paper, we demonstrate a narrative from events detected in the international relations domain along with classification of events using our novel mediation ontology. We also introduce a novel method of classifying ev...
متن کاملStory Link Detection Based on Event Words
In this paper, we propose an event words based method for story link detection. Different from previous studies, we use time and places to label nouns and named entities, the featured nouns/named entities are called event words. In our approach, a document is represented by five dimensions including nouns/named entities, time featured nouns/named entities, place featured nouns/named entities, t...
متن کاملRhetorical Structure Analysis of EFLs’ Written Narratives of a Picture Story
This study was set to reveal how second language learners use rhetorical relations in their written narratives in terms of Rhetorical Structure Theory (RST) primarily proposed by Mann & Thompson (1987) and developed by Mann, Matthiessen & Thompson (1992). To this end, sixty written narratives based on the picture story book ‘Frog, where are you?’ were collected from EFL learners and were put to...
متن کاملTWO-STAGE METHOD FOR DAMAGE LOCALIZATION AND QUANTIFICATION IN HIGH-RISE SHEAR FRAMES BASED ON THE FIRST MODE SHAPE SLOPE
In this paper, a two-stage method for damage detection and estimation in tall shear frames is presented. This method is based on the first mode shape of a shear frame. We demonstrate that the first mode shape slope is very sensitive to the story stiffness. Thus, at the first stage, by using the grey system theory on the first mode shape slope, damage locations are identified in shear frames. Da...
متن کاملCharacter Profiling in 19th Century Fiction
This paper describes the way in which personal relationships between main characters in 19 century Swedish prose fiction can be identified using information guided by named entities, provided by a entity recognition system adapted to the 19 century Swedish language characteristics. Interpersonal relation extraction is based on the context between two relevant, identified person entities. The re...
متن کامل